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ABSTRACT 

Thermodynamic data are reported revealing that 
pseudouridine (4*) can stabilize RNA duplexes 
when replacing U and forming *P-A, T-G, *P-U and 
T-C pairs. Stabilization is dependent on type of 
base pair, position of T within the RNA duplex, 
and type and orientation of adjacent Watson-Crick 
pairs. NMR spectra demonstrate that for internal 
T-A, "P-G and T-U pairs, the N3 imino proton is 
hydrogen bonded to the opposite strand nucleotide 
and the N1 imino proton may also be hydrogen 
bonded. CD spectra show that general A-helix 
structure is preserved, but there is some shifting 
of peaks and changing of intensities. T has two 
hydrogen donors (N1 and N3 imino protons) and 
two hydrogen bond acceptors because the glyco- 
sidic bond is C-C rather than C-N as in uridine. 
This greater structural potential may allow T to 
behave as a kind of structurally driven universal 
base because it can enhance stability relative to U 
when paired with A, G, U or C inside a double helix. 
These structural and thermodynamic properties 
may contribute to the biological functions of 4*. 



INTRODUCTION 

Pseudouridine (5-ribosyluracil, ^) is one of the most 
abundant modified nucleotides (1). For example, it is 
found in transfer, ribosomal, small nuclear and small nu- 
cleolar RNAs (snoRNAs), and is likely important for 
structure and function. Isomerization of the glycosidic 
bond from Nl to C5 of uracil (Figure 1) enhances base 
rotation (1,2) and allows both Nl and N3 imino protons 
to serve as hydrogen bonding donors. It was postulated 
that ^I* could increase the thermodynamic stability of 
RNA duplexes due to formation of an additional 
hydrogen bond and/or better stacking (3,4). In compari- 
son to uridine, ^' favors C3'-endo conformation of the 



ribose which is correlated with anti conformational pref- 
erences of nucleobases (3). Pubhshed results demonstrate 
structural rigidity of *!* within single- and double-stranded 
RNA regions, but the basic topology of U and ^ contain- 
ing RNA is usually similar (1,3-10). In a model for the U2 
snRNA involved in splicing of pre-mRNA, however, 
<I'-induced significant changes of the architectural land- 
scape of the spliceosomal branch site of RNA compared 
with an unmodified counterpart (11,12). 

4' is found in the T^I'C loop of almost all tRNAs. For 
most tRNAs, ^ is also found in the D stem and/or in the 
anticodon stem and loop (1). ^I* contributes to stabilization 
of specific structural motifs. For example, ^ is phylogen- 
etically conserved in major splicesomal snRNAs (Ul, U2, 
U4, U5 and U6) and mostly located in regions involved in 
RNA-RNA and RNA-protein interactions essential for 
assembly of the splicesome and for the splicing process 
(12-14). The 5' end of U2 snRNA binds directly to pre- 
mRNA splice sites and is particularly rich in ^ residues 
(12). Structural studies of models for RNAs involved in 
splicing indicate that a *I'-A pair adjacent to a splice site 
stabilizes the helix foimed between U2 snRNA and pre- 
mRNA and induces bulging out of an adenosine to initiate 
splicing (4). The function of ^ in snoRNAs is unknown. >!' 
is also present in functionally important regions of ribo- 
somal RNAs (small subunit, large subunit, 5.8S and 5S), 
including the peptidyl transferase center (PTC) (1). In eu- 
karyotic large subunit rRNAs, ^I* accounts for 0.9-1.4% 
of the nucleotides (1). Positions of many 4' in rRNAs are 
very conserved, suggesting importance for structure and 
function. For example, Escherichia coli rRNA helix 69 
(region 1906-1924) and human rRNA region 3722-3740 
contain three and five conserved 4' residues, respectively 
(1,15,16). Spectral studies on a bacterial hairpin contain- 
ing hehx 69 show that *'s exhibit a range of effects on 
RNA stability and structure, depending on their location 
(5,7,8,17). ^''s placed in the single-stranded loop destabil- 
ize the hairpins whereas *!* in the helical stem increases 
stabihty. 

There is limited information concerning the influence of 
^ on structure and thermodynamic stabilities of RNAs 
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(3,5,7-9,17-19). Published data concern mostly the foUow- 
ing topics: (i) influence of ^ on stabihties, structure and 
recognition of anticodon hairpins of tRNAs (1,6,20,21); 
(ii) contribution of ^' residues to structure and stabihty of 
hairpins, e.g. helix 69 and region 3722-740 of E. coli 23S 
rRNA and Homo sapiens 28S rRNA (5,7,8,17); (hi) 
importance of conserved ^ modification in eukaryotic 
U2 snRNA for inducing changes in branch-site archi- 
tecture of pre-mRNA at 5'-splice sites (4,9,11,14,18,22); 
and (iv) sequence dependence of stabihties of *-A 
pairs (19). 

Here, we present results concerning the influence of * 
residues on thermodynamic stabilities of RNA duplexes 
containing ^I'-A, *-G, >I'-U and ^-C pairs at internal 
and terminal positions. The contribution of ^I* as 5'- and 
3'-terminal danghng ends is also reported. Additionally, 
the influence of A-U and G-C adjacent pairs on the 
thermodynamic stabihties of RNA duplexes with a 
central 4'- A or ^-G pair is reported. Circular dichroism 
(CD) spectra of >!' containing RNA duplexes were also 
measured. Thermodynamic and CD results for ^ are 
compared to those of isosequential duplexes with U 
instead of NMR spectra of imino proton regions 
of RNA duplexes containing ^I* indicate that the N3 
proton is usually hydrogen bonded in *I'-A, »1'-G and 
^-U pairs. 



MATERIALS AND METHODS 

Choice of oligonucleotide sequences 

The oligonucleotide sequences were selected to allow 
placing of ^I* at internal and terminal positions with the 
same nearest neighbor to minimize influence of 
surrounding Watson-Crick base pairs on stabihties, 
structures and interactions of ^I* pairs. Duplexes with 



4'-A, *-G, 4'-U and 4'-C pairs were compared to 
otherwise identical duplexes with ^ replaced by U. 

Synthesis and purification of oligonucleotides 

Ohgonucleotides were synthesized on an Applied 
Biosystems DNA/RNA and MerMadel2 (Bioauto- 
mation) synthesizers, using standard P-cyanoethyl 
phosphoramidite chemistry on solid support (23). 
Commercially available A, C, G, U (GenePharma) and 
^1 (ChemGenes) phosphoramidites with 2'-0-tertbutyldi- 
methylsilyl were used for synthesis of RNA. The syntheses 
of afl ohgonucleotides were performed on universal 
Unylinker synthesized for this purpose. The details of 
deprotection and purification of oligoribonucleotides 
were described previously (24,25). Thin-layer 
chromatography (TLC) purification of the oligonuc- 
leotides was carried out on Merck 60 F254 TLC plates 
with the mixture 1-propanol/aqueous ammonia/water 
= 55:35:10 (v/v/v). Purified oligonucleotides were 
characterized by mass spectrometry (MALDl TOF). 

UV melting experiments 

Ohgonucleotides were melted in buffer containing 1 M 
NaCl, 20 mM sodium cacodylate, 0.5 mM NazEDTA, 
pH 7. Ohgonucleotide single strand concentrations were 
calculated from absorbance above 80° C and single strand 
extinction coefficients were approximated by a nearest- 
neighbor model (26,27). The extinction coefficient of 4' 
nucleoside at 260 nm is ~20% smaller than for U but 
was assumed to be equal to U (28). This results in only 
1-2% error in duplex concentration. Absorbance versus 
temperature melting curves for nine concentrations of 
RNA duplexes ranging in concentration between 10~^ 
and 10~^M were measured at 260 nm with a heating rate 
of l°C/min from 0 to 90°C on a Beckman DU 640 
or JASCO V-650 spectrophotometer with a thermo- 
programmer. Melting curves were analysed and 
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thermodynamic parameters calculated from a two-state 
model with the program MeltWin 3.5, which includes 
standard deviation estimates of errors (29). For all 
sequences, except 5'UCACUGAGU/3'AGUGGCUCA, 
the AH° derived from Tp^' versus ln(CT/4) plots is 
within 15% of that derived from averaging the fits to 
individual melting curves, as expected if the two-state 
model is reasonable. For the exception, two transitions 
were observed, so the two-state approximation is not 
valid. Comparisons between duplexes used values from 
T^' versus In (Ct/4) plots fit by linear least squares. 
Standard deviations were propagated by taking the 
square root of the sum of the squares of the individual 
standard deviations. The correlations coefficients between 
AH° and AS° were 0.99995 for duplexes with and without 
^f, as calculated from Tm's (24). 

CD spectroscopy 

CD spectra of RNA duplexes were measured in triplicate 
from 205 to 350 nm at 10°C and 25°C on a JASCO 815 
spectropolarimeter. The buffer was 1 M NaCl, 20 mM 
sodium cacodylate and 0.5 mM Na2EDTA at pH 7 and 
duplex concentration was 0.12mM. The measured CD 
spectra were averaged; spectrum of the buffer subtracted 
and the result converted into molar ellipticity per 
nucleotide (As). 

Nuclear magnetic resonance 

All RNAs used for NMR measurements were dissolved in 
buffer containing 150mM NaCl, lOmM sodium 
phosphate and 0.1 mM EDTA, pH 6.8. For studies of 
exchangeable protons, the solvent was H2O/D2O (9:1, 
v/v) and for assignment of aromatic protons the solvent 
was 99.990% D2O (Sigma Aldrich). All the samples were 
annealed by heating at 90°C for 5 min, then slowly cooled 
to room temperature and stored at 4°C. 

NMR spectra were measured on a Bruker AVANCE III 
700 MHz spectrometer, equipped with a QCl CryoProbe. 
The 3 mm sample tubes were used with a final sample 
volume of 210 |il. The ID proton spectra in H2O/D2O 
(9:1, v/v) were collected with water suppression using 
excitation sculpting with gradients (30) and in D2O, with 
presaturation, from 128 scans. Assignment of resonances 
was based on homonuclear NOESY (30) and 
heteronuclear 'H-'^N HSQC spectra (31-34). Spectra 
were processed and prepared with TopSpin 3.0 Bruker 
Software. 



RESULTS 

Thermodynamic data for model RNA duplexes with ^I* 
and, in parenthesis, with * replaced by U are presented 
in Table 1 . The thermodynamic effects of adding terminal 
nucleotides were measured by extending core duplexes 5'C 
AGUCAGU/3'GUCAGUCA and 5'UCAGUCAG/3'AG 
UCAGUC (Table 2). The thermodynamic effects of 
replacing U with ^I* were measured in duplexes of the 
type 5'UCAXMYAGU/3'AGUYNXUCA, where X-Y is 
a Watson-Crick pair, M is U or ^I* and N is A, C, G or U 
(Tables 1 and 3). 



Contributions of terminal unpaired and paired W on 
thermodynamic stabilities of RNA duplexes 

Table 2 summarizes thermodynamic effects of adding 
teiTninal nucleotides U or * as a 5'-dangling end on the 
C of a CG pair. U and 4' enhanced free energy of duplex 
formation (AAG37) by similar increments of 0.78 and 
0.66kcal/mol, respectively, at 37°C. Both stability 
increments are more favorable than the O.lkcal/niol 
measured for the 5'-danghng U of (UCCGGp)2, where p 
represents a 3'-phosphate (39). Either U or 4' as a 
3'-danghng end on the G of a GC pair favors duplex 
(AG37) by 0.97kcal/mol, within experimental error of 
the 0.6kcal/mol measured for CCGGUp (40). It is 
commonly postulated that enhancement of RNA duplex 
stability by single 5'- and 3'-dangling ends are due to the 
stacking abihty of the unpaired nucleotide (40-42). Based 
on NMR structural studies it is postulated that ^' stacks 
better than U, presumably due to better rotational lability 
of the Cr-C5 glycoside bond in ^I' over the Cl'-Nl in U 
(3). However, those conclusions were formulated for ^' in 
an internal position of helical RNA. 

Larger and sequence dependent stability increments are 
measured when a 5' or 3' terminal U or ^I* is paired with A, 
G, U or C. A 5'-terminal U-A in a 5'UC/3'AG nearest- 
neighbor sequence enhances duplex stability by 1.88 kcal/ 
mol at 37°C, identical within error limits to the 1.81 kcal/ 
mol measured for the 5'-temiinal 5'>I'C/3'AG sequence 
(Table 1). A similar negligible difference is seen when U is 
replaced by * in the 3'-terminal 5'GU/3'CA sequence, 
where addition of the terminal U-A or *I'-A pairs provide 
stability increments of -2. 18 and-2.10kcal/mol, respectively. 
The increments of -1.88 and -2.18 kcal/mol for U are similar 
to increments of (-2.35 + 0.45) = -1 .90 and (-2.24 + 0.45) = - 
1.79 kcal/mol expected from a nearest-neighbor model 
involving only Watson-Crick pairs (24). The increment of- 
1.81 kcal/mol for the terminal *-A pair is similar to the value 
(-2.49 + 0.31) = -2.18 kcal/mol expected from a nearest- 
neighbor model, but the measured -2.10 kcal/mol is smaller 
than the predicted (-3.29 + 0.31) = -2.98 kcal/mol (19). 

Adding a U-G and ^'-G pair at the 5' end enhanced 
stability by 1.78 (predicted is 1.80 kcal/mol, (43)) and 
1.53 kcal/mol, respectively, which is within experimental 
error of the values for U-A and ^'-A (Table 2). Placing 
U-G and ^'-G pairs at the 3' end enhanced stabihties by 
2.44 (predicted is 2. 15 kcal/mol) and 3.22 kcal/mol, 
respectively, suggesting some extra stabilization from the 
3'-terminal *-G. Adding a U-U or >I'-U pair at the 5' end 
enhanced stabihty by 0.86 and 1.41 kcal/mol, respectively, 
similar to values of 1.05 and 1.83 kcal/mol for the 
additions at the 3' end. Here ^^-U appears more stable 
than U-U at both ends. Adding a U-C or *I'-C pair at 
the 5' end enhanced stability by 1.12 and 0.81 kcal/mol, 
respectively, which are close, considering experimental 
errors. The values at the 3' end are 1.36 and 1.52 kcal/ 
mol, respectively, also within experimental error. 

The results summarized in Table 2 reveal that in most 
cases the thermodynamic effect of substitution of ^I* for U 
at terminal positions is within experimental error. The 
exceptions are 3' terminal U-G and U-U pairs in the 
5'GU/3'CG and 5'GU/3'CU nearest-neighbor motifs, 
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Table 1. Thermodynamic parameters of duplex formation with pseudouridine'' 



Average of curve fits T^,' versus log Cx plots 



-AH° -AS° -AG37 Tm'' -AH° -AS° -AG°3/''-'' Tm'' 

(kcal/mol) (eu) (kcal/mol) (°C) (kcal/mol) (eu) (kcal/mol) (°C) 



5' 


fCAGUCAGU 


3' 


78.5 


± 


2.8 


216.8 


± 


8.5 


11.23 


± 


0.14 


3' 


GUCAGUCA 


5' 


(76.3 


± 


4.1)' 


(210.2 


± 


12.7) 


(11.06 


± 


0.21) 


5' 


UCAGUCACP 


3' 


77.1 


± 


8.3 


212.7 




25.5 


11.17 




0.44 


3' 


AGUCAGUC 


5' 


(83.8 


± 


12 31 


(234.5 


± 


38 71 


(1 1.05 


± 


0.51) 


5' 


•PCAGUCAGU 


3' 


76.0 


± 


3.4 


207.3 


± 


10.6 


11.64 


± 


0.13 


3' 


AGUCAGUCA 


5' 


(92.6 


± 


8.1) 


(257.6 


± 


24.7) 


(12.68 


± 


0.50) 


5' 


UCACPCAGU 


3' 


85.0 


± 


7.2 


233.1 


± 


21.9 


12.73 


± 


0.37 


3' 


AGUCAGUCA 


5' 


(92.6 


± 


8.1) 


(257.6 


± 


24.7) 


(12.68 


± 


0.50) 


5' 


UCACPGAGU 


3' 


105.5 


± 


8.2 


293.3 


± 


24.4 


14.51 


± 


0.32 


3' 


AGUGACUCA 


5' 


(85.4 


± 


8.5) 


(236.5 


± 


12.1) 


(12.07 


± 


0.22) 


5' 


UCAAfUAGU 


3' 


78.9 


± 


7.3 


226.8 


± 


22.9 


8.51 


± 


0.18 


3' 


AGUUAAUCA 


5' 


(83.2 


± 


9.1) 


(242.0 


± 


28.8) 


(8.16 


± 


0.25) 


5' 


UCAUfAAGU 


3' 


76.7 


± 


8.0 


220.1 


± 


25.1 


8.48 




0.19 


3' AGUAAUUCA 


5' 


(80.1 


± 


5.9) 


(232.7 


± 


19.2) 


(7.87 


± 


0.07) 


5' 


UCAGUCACP 


3' 


79.8 


± 


10.2 


218.7 




31.1 


11.97 




0.55 


3' 


AGUCAGUCA 


5' 


(92.6 


± 


8 n 


(257.6 


± 


24 71 


(12.68 


± 


0.50) 


5' 


•PCAGUCAGU 


3' 


77.7 


± 


4.5 


213.5 


± 


13.7 


11.51 


± 


0.28 


3' 


GGUCAGUCA 


5' 


(89.4 


± 


7.8) 


(248.2 


± 


23.7) 


12.42 


± 


0.46) 


5' 


UCACPCAGU 


3' 


95.8 


± 


10.3 


264.9 


± 


30.8 


13.62 


± 


0.76 


3' 


AGUCGGUCA 


5' 


(94.3 


± 


4.2) 


(263.9 


± 


12.9) 


(12.51 


± 


0.32) 


5' 


UCACPGAGU 


3''' 


82.2 


± 


6.6 


227.2 




19.9 


1 1.77 


± 


0.42 


3' 


AGUGGCUCA 


5' 


(105.9 


± 


7 0) 


(301.3 


± 


20.8) 


(12.49 


± 


0.66) 


5' 


UCAA-PUAGU 


3' 


75.6 


± 


5.7 


218.4 




18.4 


7.89 


± 


0.14 


3' 


AGUUGAUCA 


5' 


(79.5 


± 


12.1) 


(233.5 


± 


38.6) 


(7.04 


± 


0.15) 


5' 


UCAU>PAAGU 


3' 


78.8 


± 


8.5 


230.2 




27.2 


7.41 


± 


0.12 


3' AGUAGUUCA 


5' 


(71.4 


± 


1.1) 


(209.4 


± 


3.8) 


(6.41 


± 


0.14) 


5' 


UCAGUCACP 


3' 


99.7 


± 


5.6 


277.6 


± 


17.1 


13.60 


± 


0.35 


3' 


AGUCAGUCG 


5' 


(95.9 


± 


8.4) 


(267.5 


± 


25.5) 


(12.98 


± 


0.53) 


5' 


•PCAGUCAGU 


3' 


87.4 


± 


6.8 


243.4 


± 


20.8 


11.95 


± 


0.36 


3' 


UGUCAGUCA 


5' 


(78.6 


± 


6.2) 


(217.3 


± 


19 I) 


(1 1.24 




0.32) 


5' 


UCACPGAGU 


3' 


77.2 


± 


7.7 


217.9 


± 


24.0 


9.62 


± 


0.27 


3' 


AGUCUGUCA 


5' 


(75.5 


± 


9.4) 


(214.9 


± 


29.4) 


(8.83 


± 


0.31) 


5' 


UCAGUCACP 


3' 


83.4 


± 


5.1 


231.2 


± 


15.7 


11.70 


± 


0.27 


3' 


AGUCAGUCU 


5' 


(78.7 


± 


3.1) 


(218.0 


± 


9.5) 


(11.10 


± 


0.15) 


5' 


•PCAGUCAGU 


3' 


74.4 


± 


3.8 


204.1 


± 


11.9 


11.08 


± 


0.20 


3' CGUCAGUCA 


5' 


(78.4 


± 


1.1) 


(215.6 


± 


3.3) 


(11.56 


± 


0.09) 


5' 


UCACPGAGU 


3' 


69.6 


± 


5.3 


197.2 


± 


17.0 


8.41 


± 


0.10 


3' AGUCCGUCA 


5' 


(77.6 


± 


12.3) 


(223.7 


± 


38.5) 


(8.23 


± 


0.32) 


5' 


UCAGUCACP 


3' 


77.7 


± 


3.3 


214.1 


± 


10.3 


11.32 


± 


0.16 


3' 


AGUCAGUCC 


5' 


(74.9 


± 


4.2) 


(205.9 


± 


12.9) 


(11.05 




0.22) 


5' CAGUCAGU 3 
3' GUCAGUCA 5' 




71.0 


± 


3.5 


196.6 


± 


11.0 


10.04 


± 


0.14 


5' 
3' 


UCAGUCAG 3' 
AGUCAGUC 5' 


73.5 


± 


5.0 


204.5 


± 


15.6 


10.07 


± 


0.22 



56.8 


71.7 


± 


1.4 


195.8 


± 


4.3 


10.92 ± 0.06 


57.2 


(56.6) 


(75.8 


± 


5.1) 


(208.7 


± 


15.8) 


(11.04 ± 0.24) 


(56.6) 


56.9 


72.9 




5.1 


199.9 


± 


15.8 


10.93 ± 0.25 


56.9 


(54.7) 


(78.7 


± 


6.3) 


(218.5 


± 


19.6) 


(10.93 ± 0.27) 


(55.4) 


59.9 


84.3 




2.0 


233.0 


± 


6.1 


12.07 ±0.11 |12.33| 


58.8 


(59.1) 


(83.7 


± 


5.6) 


(230.7 


± 


17.3) 


(12.14 ± 0.34) {12.5} 


(59.3) 


61.4 


87.2 


± 


7.0 


239.9 


± 


21.1 


12.85 ± 0.43|13.69| 


61.2 


(59.1) 


(83.7 


± 


5.6) 


230.7 


± 


17.3) 


(12.14 ± 0.34) {12.5} 


(59.3) 


62.4 


104.9 


± 


6.2 


291.6 


± 


18.5 


14.44 ± 0.47 |13.28| 


62.3 


(58.5) 


(84.6 


± 


2.8) 


(234.1 


± 


8.4) 


(12.01 ± 0.14) {12.5} 


(58.5) 


45.0 


71.5 


± 


2.7 


203.5 


± 


8.7 


8.36 ± 0.05 |11.41i 


45.2 


(43.2) 


(80.2 


± 


8.6) 


(232.5 


± 


27.4) 


(8.09 ± 0.27) {7.9} 


(43.2) 


45.1 


74.1 


± 


4.1 


211.8 


± 


13.3 


8.42 ± 0.08 |9.37| 


45.1 


(42.3) 


(84.3 


± 


9.2) 


(246.5 


± 


29.9) 


(7.87 ± 0.17) {7.9} 


(42.0) 


59.7 


82.6 




6.8 


227.3 


± 


20.9 


12.06 ± 0.42 |12.44| 


59.3 


(59.1) 


(83.7 




5.6) 


(230.7 


± 


17.8) 


(12.14 ± 0.34) {12.5} 


(59.3) 


58.2 


83.5 


± 


2.4 


231.3 


± 


7.3 


11.79 ± 0.12 


57.8 


(58.9) 


(83.3 


± 


6.5) 


(229.6 


± 


19.7) 


(12.04 ± 0.36) {12.5} 


(59.0) 


61.8 


93.4 


± 


4.2 


257.9 


± 


12.8 


13.41 ± 0.27 


61.7 


(58.0) 


(84.9 


± 


3.5) 


(235.1 


± 


10.8) 


(12.01 ± 0.18) {11.9} 


(58.4) 


58.1 


79.5 




2.3 


218.8 


± 


7.3 


11.61 ± 0.12 


58.2 


(55.5) 


(76.8 


± 


1.8) 


(211.5 


± 


5.6) 


(11.18 ± 0.08) {11.4} 


(57.0) 


42.7 


68.4 




5.0 


195.5 




15.9 


7.82 ± 0.10 


43.0 


(39.0) 


(73.1 


± 


2.4) 


(213.2 


± 


7.8) 


(7.00 ± 0.02) {7.3} 


(39.0) 


40.5 


70.7 


± 


2.5 


204.0 


± 


8.2 


7.40 ± 0.02 


40.9 


(36.5) 


(62.9 


± 


3.2) 


(181.6 


± 


10.5) 


(6.56 ± 0.07) {6.7} 


(37.1) 


60.7 


93.1 


± 


2.6 


257.6 


± 


7.9 


1 T 1 0 1 A 1 /T 

13.18 ± 0.16 


60.9 


(59.3) 


(86.7 


± 


4.2) 


(239.6 


± 


12.7) 


(12.40 ± 0.24) {12.9} 


(59.5) 


57.5 


82.3 


± 


1.7 


227.7 


± 


5.2 


11.67 ± 0.09 


57.7 


(56.7) 


(76.7 


± 


4.0) 


(211.5 


± 


12.2) 


(11.12 ± 0.18) 


(56.7) 


49.9 


69.4 


± 


4.2 


193.7 


± 


13.2 


9.35 ± 0.13 


50.1 


(46.8) 


(65.7 


± 


2.6) 


(184.0 


± 


8.1) 


(8.58 ± 0.05) 


(47.0) 


57.5 


85.7 


± 


4.1 


238.2 


± 


12.6 


11.79 ± 0.22 


57.3 


(56.1) 


(76.9 


± 


1.2) 


212.6 


± 


3.6) 


(11.01 ± 0.05) 


(56.2) 


57.2 


74.9 


± 


3.4 


206.0 


± 


10.6 


11.07 ± 0.15 


57.0 


(58.3) 


(74.8 


± 


1.2) 


(204.5 


± 


3.7) 


(11.38 ± 0.06) 


(58.5) 


45.6 


67.2 




1.5 


189.8 


± 


4.7 


8.34 ± 0.02 


45.6 


(43.9) 


(69.8 


± 


3.1) 


(199.1 


± 


9.8) 


(8.07 ± 0.05) 


(44.0) 


57.4 


81.6 


± 


2.2 


226.0 


± 


6.8 


11.48 ± 0.11 


57.0 


(56.9) 


(80.7 


± 


1.4) 


(223.8 


± 


4.1) 


(11.32 ± 0.07) 


(56.6) 


53.1 


77.6 




4.0 


217.1 


± 


12.4 


10.26 ± 0.15 


52.7 


52.7 


71.2 


± 


3.9 


197.4 


± 


12.3 


9.96 ± 0.15 


52.7 



"Solutions are 1 M NaCl, 20 mM sodium cacodylate and 0.5 mM Na2EDTA, pH 7. 
''Calculated for lO^M total oligonucleotide strand concentration. 

°In parenthesis the thermodynamic parameters of analogs RNA duplexes with U instead of *. 

''in square brackets the free energy of duplexes calculated according to * nearest-neighbor parameters published in (19). 

''In braces free energy of unmodified RNA duplexes calculated with RNAstructure (35,43). 

"^Two transitions were observed when central pair was U-G, so the two-state approximation is not valid. 



respectively, where replacement of U with ^ enhances 
stabihty by 0.8 kcal/mol. In all but two cases, the 
terminal stability increments are also consistent with 
previous measurements. One difference is that the 5' 
dangling end U in the 5'UC/3'G motif had a stabihty 
increment of -0.78 kcal/mol compared to a previous 
single measurement of -0.1 kcal/mol. It is possible that 
the enhanced stability reflects a dynamic pairing 
ensemble of the 3'G with 5'UC relative to that possible 
with the previous measurement on the shorter duplex (UC 
CGGp)2. Another possibility is an unexpected effect from 
the 3' phosphates in (UCCGGp)2. The other case where 



the measurement is not predicted within experimental 
error is the 3' terminal 5'G4'/3'CA motif where the 
measured stability increment is -2. 10 kcal/mol compared 
to a nearest-neighbor prediction of -2.98 kcal/mol. The 
nearest-neighbor parameters for ^'-A pairs, however, are 
currently based on only five occurrences of each (19). 

Contribution of internal paired T on thermodynamic 
stabilities of RNA duplexes 

Somewhat larger effects were measured when U was 
replaced by * in *-A, *I'-G, ^-U and ^-C pairs at 
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Table 2. Thermodynamic increments for adding terminal nucleotides 



Core duplex 


Added U 
terminus 


ag;„ 

(kcal/mol) 


Added * 
terminus 


ag;„ 

(kcal/mol) 


aag;/ 

(kcal/mol) 


5' CAGUCAGU 3' 
3' GUCAGUCA 5' 


5'U 

5'U/3'A 
5'U/3'G 
5'U/3'U 
5'U/3'C 


-0.78 ± 0.28 (-0.1)'' 
-1.88 ± 0.37 (-1.9f 
-1.78 ± 0.39 (-1.8)" 
-0.86 ± 0.23 (-1.0)'' 
-1.12 ± 0.16 (-0.9)^ 


5'* 

5'*/3'A 
5'*/3'G 
5'*/3'U 
5'*/3'C 


-0.66 ± 0.16 
-1.81 ± 0.17 (-2.18)'' 
-1.53 ± 0.19 
-1.41 ± 0.17 
-0.81 ± 0.21 


+0.12 
+0.07 (0.2) 
+0.25 
-0.55 
+0.31 


5' UCAGUCAG 3' 
3' AGUCAGUC 5' 


3'U 

3'U/5'A 
3'U/5'G 
3'U/5'U 
3'U/5'C 


-0.97 ± 0.31 (-0.6)'' 
-2.18 ± 0.37 (-1.79)" 
-2.44 ± 0.28 (-2.15)" 
-1.05 ± 0.16 (1.0)'' 
-1.36 ± 0.17 (-1.2)'' 


3'* 

3'vI'/5'A 
3'vl'/5'G 
3'*/5'U 
3'*/5'C 


-0.97 ± 0.29 
-2.10 ± 0.45 (-2.98)'^ 
-3.22 ± 0.22 
-1.83 ± 0.23 
-1.52 ± 0.19 


0.00 
+0.08 (0.1) 
-0.78 
-0.78 
-0.16 



Thermodynamic values used are from Tm ' plots. 

"Differences between free energy increments from replacing U with *. 

''Predicted based on (39). 

"Predicted based on (24). 

''Predicted based on (19). 

"Predicted based on (43). 

"^Predicted based on (36). 

^Predicted based on (37). 

''Predicted based on (40). 



Table 3. Thermodynamic effect from replacing U with * at middle position" 



Sequences of the duplexes 


AAG37'' 


AAG",,'' 


AAG3,'' 


AAG37'' 




(kcal/mol) 


(kcal/mol) 


(kcal/mol) 


(kcal/mol) 




X = A 


X = G 


X = U 


X = C 


5' UCAGTCAGU 3' 
3' AGUCXGUCA 5' 


-0.71 ± 0.55 (-1.2) 


-1.40 ± 0.34 


-0.77 ± 0.14 


-0.27 ± 0.05 


5' UCACTGAGU 3' 
3' AGUGXCUCA 5' 


-2.43 ± 0.49 (-0.8) 


-0.43 ± 0.14" 






5' UCAAfUAGU 3' 
3' AGUUXAUCA 5' 


-0.27 ± 0.27 (-3.5) 


-0.82 ± 0.10 






5' UCAUTAAGU 3' 
3' AGUAXUUCA 5' 


-0.55 ± 0.19 (-1.5) 


-0.84 ± 0.07 







"Solutions are IM NaCl, 20 mM sodium cacodylate and 0.5 mM NajEDTA, pH 7. 

''Differences in measured free energies relative to analogous RNA duplexes with uridine (U) instead of pseudouridine (*); values in parenthesis are 
differences predicted on basis of nearest parameters (38) and (19); thermodynamic values used are from Tm"' plots. 
"Two transitions were observed when central pair was U-G, so two-state approximation is not valid. 



internal positions (Table 3). In most cases, replacing U 
with 'I' clearly enhanced thermodynamic stabilities (i.e. 
gave more negative AG37). 

Most enhancements in stabilities upon replacing U with 
^I* to give internal *-A, ^'-G and *-U pairs ranged from 
0.3 to 0.8 kcal/mol (Table 3). The two outhers are the 
motifs 5'C*G/3'GAC and 5'GvI/C/3'CGG where * 
substitution for U enhances duplex stability by 2.43 and 
1.40 kcal/mol, respectively. In these cases, flipping the 
Watson-Crick base pair orientation gives enhancements 
of 0.71 and ~0. 4 kcal/mol, respectively, which is within 
the usual range of AGjy's. Presumably, the outliers 
reflect favorable stacking and hydrogen bonding 
interactions of »!' relative to U. As shown below, there is 
no NMR evidence for hydrogen bonding involving the HI 
imino proton of * in the 5'G>I'C/3'CGG duplex. The 
results suggest that 3D structures and computational 
studies of the outlier duplexes might be interesting 
because little is known about interactions favoring 
stacking. 



Because nearest-neighbor parameters are available for 
U-A and ^-A pairs adjacent to Watson-Crick pairs, it is 
possible to compare the measured and predicted effects of 
replacing a middle U-A pair with ^-A (Table 3). In two 
cases, the difference between measured and predicted 
exceeds 1 kcal/mol. The measured versus predicted 
values are -2.43 versus -0.8 kcal/mol for the 5'C*G/ 
3'GAC central motif and -0.27 versus -3.5 kcal/mol for 
the 5'AW/3'UAA central motif. The 5'A*U/3'UAA 
motif occurs in the duplex predicted least weU by 
nearest-neighbor parameters (Table 1). It also contains 
the two ^^-A nearest-neighbor parameters (19) that differ 
most from their U-A (24) equivalents with ^I* 
enhancements of -1.70 and -1.81 kcal/mol. 

NMR reveals at least one hydrogen bond of pseudouridine 
pairing with A, G or U 

Pseudouridine is potentially more versatile in its hydrogen 
bonding than uridine because two imino protons are 
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available to serve as hydrogen bond donors (Figure 1). 
NMR spectroscopy was used to compare hydrogen 
bonding in a central U-A pair with that with a central ^I* 
paired with A, C, G or U (Figure 2). 

NMR of duplexes with a U-A or *P-A central pair 

The ID proton spectrum of the duplex containing a 
U5-A14 pair revealed seven imino proton resonances. 
The G and U imino protons were distinguished based 



on analysis of '^N chemical shifts in 'H-'^N HSQC 
spectra and by characteristic NOE cross-peaks to C 
amino protons for G and to AH2 protons for U. 
Assignments were made with 2D NOESY spectra (data 
not shown), which revealed typical intra- and inter- 
strand correlations. No imino resonances were detected 
for the terminal U's. Non-exchangeable protons were 
assigned with standard methods (44). The NOESY 
spectra recorded in D2O exhibited typical features of 



(g\ 123456789 
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18 1716 1514 13 12 11 10 
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U14 
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Figure 2. ID NMR spectra. Buffer is 150mM NaCl, lOmM sodium phosphate and 0.1 mM EDTA, pH 6.8. For spectra (a), (b) and (c) to (h), 
duplex concentrations were 1.5, 1.0 and 0.5 niM, respectively. 
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double-stranded sequential connectivities and are 
consistent with a duplex structure in which all nucleotides 
are involved in Watson-Crick base pairs. 

The ID proton spectrum of the *-A duplex (Figure 2) 
shows eight major imino resonances and is similar to that 
of the U-A duplex except for the appearance of a new 
signal at 10.25 ppm. This high field resonance was 
assigned to the ^sNlH imino proton and its chemical 
shift is in agreement with previous observations of 4'-A 
base pairs (8,17,45). Assignments of ^sNlH resonances 
throughout this work were based on the fact that they 
exhibit strong NOEs to upfield peaks assigned as 
*I'5CH6. The ^I'sNBH imino proton is shifted upfield by 
~0.19ppm relative to the U5NIH imino proton of the 
U-A duplex. The presence of NOE cross-peaks between 
*5N3H and G4NIH and G13NIH was helpful in their 
unambiguous assignments. The observed strong NOE 
cross correlation between ^sNSH and the A14H2 proton 
(data not shown) demonstrated that pseudouridine 
pairs with adenosine A14 in a Watson-Crick manner 
involving ^sNSH-A^Nl and 4'502-Ai4N6H hydrogen 
bonds (Figure 1). 

Spectra were similarly measured, assigned and 
interpreted for duplexes with a central ^-A in triplets 
5'C*G/3'GAC, 5'AW/3'UAA and 5'U>;'A/3'AAU 
(Figure 2). In all cases, the ^-A pair is in a Watson- 
Crick conformation. The spectra for the 5'A*I'U/3'UAA, 
and 5'U4'A/3'AAU duplexes show eight resonances (this 
includes an overlap of U15N3H and 4'5N3H for 5'AW/ 
3'UAA), with the *5N1H resonance upfield between 10 
and 1 1 ppm. The spectrum of the 5'C>I'G/3'GAC duplex, 
however, has two resonances between 10 and 11 ppm 
suggesting some conformational equihbrium involving 
^I'sNlH. The appearance and upfield chemical shift of 
the ^'sNlH is consistent with those protons being partially 
protected from exchange with water and potentially in a 
weak hydrogen bond. 

NMR of duplexes with a V-G, W-U or f -C central pair 

The ID proton spectrum of the 4'-G duplex (Figure 2) 
shows eight imino proton resonances because the 
resonances of G4 and U12 are overlapped at 13.50 ppm. 
A strong imino-imino cross-peak was observed between 
resonances at 1 1.99 and 1 1.85 ppm and the chemical shifts 
were attributed to >I'5N3H and G14NIH, respectively. This 
type of interaction is distinctive of wobble pair formation 
because of the short distance between the two imino 
protons (Figure 1). 

The ID proton spectrum (Figure 2) of the *I'-U duplex 
shows eight imino proton resonances. The <I'5N3H imino 
proton was distinguished from U14N3H by observation of 
an NOE cross-peak between the amino proton of Cg and 
»I'5N3H (data not shown). A strong NOE cross-peak 
between >I'5N3H and U14N3H protons was observed 
indicating formation of a *I'-U base pair. Based on 
NOESY data alone, however, it was not possible to 
distinguish between the types of *-U base pairs shown 
in Figure 1. 

The proton spectrum of the ^-C duplex (Figure 2) 
shows only six imino proton resonances and all were 
assigned to Watson-Crick A-U and G-C pairs. No 



imino protons of ^ were detected, presumably due to 
exchange with water. Evidently, 4' and C do not 
hydrogen bond together. Nevertheless, a continuous set 
of H8/H6-Hr NOE connectivities including *5 and C14, 
were determined for both strands, so both residues stack 
into the duplex. Additionally, the presence of aromatic- 
aromatic NOE cross-peaks between G4H8/4'5H6, *I'5H6/ 
CfiHe, G13H8/C14H6 and C14H6/C15H6 support inward 
orientation of ^'5 and C14. The lack of hydrogen 
bonding in the 4'-C pair could explain why the 5'G^C/ 
3'CCG duplex is the least stable of those with a central 
5'G*C/3'CNG motif. 

CD spectra of RNA duplexes containing pseudouridine 

The influence of * residues on global helical structure was 
investigated by measuring CD spectra for aU the duplexes 
listed in Table 1. Spectra at 10°C for some of the duplexes 
with internal ^^ are shown in Figure 3 (see also Supporting 
Information). The spectra have shapes consistent with 
A-form RNA helixes (46,47), although some shifting and 
changing of intensity of peaks, shifting of crossover points 
and appearance of some spectral shoulders were observed, 
consistent with hterature reports (3,5,7,8,17). CD spectra 
of duplexes with U instead of * exhibited similar variety. 
Evidently, internal substitution of ^ for U has httle effect 
on global conformation. 

Surprisingly, substitution of * for U as a 5'-dangling 
end diminished by ca. half the molar ellipticity of the 
positive peak at 266 nm and reduced even more the 
intensity of the negative peak at 209 nm (Supporting 
Information). The CD spectra with and without a 
5'-danghng U were essentially identical. When ^I* or U 
was placed as a 3'- dangling end, the CD spectra were 
nearly identical. Less intense CD spectra were also 
observed when ^' replaced the 5'U in U-A, U-G and 
U-U pairs. All the spectra with 5'-terminal >!' had 
ellipticity of less than or ~1 near 265 nm. Most other 
spectra had ellipticities between 1 and 2 in this region. 
No duplexes with elHpticity >2 near 265 nm had internal 
or 5' terminal The CD spectra did not reveal any 
structural reasons for the thermodynamic results. 

DISCUSSION 

It has been shown that replacement of U with ^I* 
thermodynamically stabilizes RNA duplexes (3,8,9,18). 
Usually stronger stacking of ^ in comparison to U was 
postulated as the major origin of enhancement of stability 
(3). Recently, Znosko's team quantified the effect for ^'-A 
pairs relative to U-A pairs by determination of nearest- 
neighbor parameters for ^^-A on the basis of optical 
melting of 24 duplexes (19). They found that on average 
at internal and terminal positions, substitution of ^'-A for 
U-A enhanced duplex stability by 1.7 and l.Okcal/mol, 
respectively. As shown in Table 1, the nearest-neighbor 
parameters for 4'-A and for Watson-Crick pairs predict 
within 1 kcal/mol the measured AGjy's for five of the six 
duplexes with ^I'-A pairs, when experimental error is 
considered. The exception has a central motif of 5'A*I'U/ 
3'UAA and is predicted to be more stable than measured 
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Figure 3. CD spectra of RNA duplexes, (a) Spectra of duplexes 5'UCAGMCAGU/3'AGUCNGUCA containing at central position U-A, *-A, *-G, 
W-U and *-C, (b) Spectra of duplexes containing at central position ip-A surrounded by various 5'- and 3'-adjacent base pairs. 



by 3kcal/mol at 37°C. This duplex contains the two ^-A 
nearest neighbors with parameters that differ most from 
those of U-A, suggesting an unusual context dependence 
for stabilities of 5'A*/3'UA and 5'*A/3'AU nearest 
neighbors. 



Tables 1-3 present for the first time thermodynamic 
effects of substituting ^I* for U to give ^-G, *-U and 
4'-C pairs. Such pairs are often found in pseudouridylated 
RNA duplexes, particularly of splicesomal snRNAs (13). 
Considering experimental error, substituting * for U in 
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these mispairs can enhance stabiHty between ~0 and 
1 kcal/mol, with an average of 0.5kcal/niol (Tables 2 
and 3). The largest stabilization after 4'-A is for ^'-G, 
whereas *I'-C has little effect. For the six duplexes with 
*I'-G, enhancements of stabilities relative to U-G 
duplexes range between 0 and 1.4 kcal/mol (Tables 2 and 
3). These initial results imply that nearest-neighbor 
parameters for ^-G pairs will be dependent on the 
neighbor, as already known for Watson-Crick (24), *I'-A 
(19) and U-G (43) pairs. 

NMR studies of duplexes with internal 4'-A, *I'-G and 
*I'-U pairs revealed hydrogen bonding via the N3 imino 
proton. The spectra are consistent with ^^ and U hydrogen 
bonding to A, G and U in similar ways (Figure 2). 
Resonances are also seen for 4'NIH proton in duplexes 
with a *-A pair, suggesting 4'NIH may be involved in a 
weak hydrogen bond. In contrast, no imino proton 
resonances were observed for ^-C, implying a lack of 
hydrogen bonding. This correlates with >I'-C duplexes 
being the least stable. CD spectra are consistent with 
expectations from NMR spectra in that the overall 
global conformation is A-form with or without 
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